Image dewarping and text extraction from mobile captured distinct documents
نویسندگان
چکیده
منابع مشابه
Image dewarping and text extraction from mobile captured distinct documents
Camera Based Document Analysis (CBDA) is an emerging field in computer vision and pattern recognition. In recent days, cameras are moulded with several items of additional equipment. Thus, they play a vital role in the replacement of scanners with hand-held imaging devices (HIDs) like digital cameras, mobile phones and gaming devices. Warping is a common appearance in camera captured document i...
متن کاملDewarping Book Page Spreads Captured with a Mobile Phone Camera
Capturing book images is more convenient with a mobile phone camera than with more specialized flat-bed scanners or 3D capture devices. We built an application for the iPhone 4S that captures a sequence of hi-res (8 MP) images of a page spread as the user sweeps the device across the book. To do the 3D dewarping, we implemented two algorithms: optical flow (OF) and structure from motion (SfM). ...
متن کاملAdaptive Information Extraction from Structured Text Documents
Effective analysis of structured documents may decide on management information systems performance. In the paper, an adaptive method of information extraction from structured text documents is considered. We assume that documents belong to thematic groups and that required set of information may be determined ”apriori”. The knowledge of document structure allows to indicate blocks, where certa...
متن کاملText Line Extraction from Complex Layout Documents
There are numerous stylish documents which do not have the traditional text layouts where printed text regions are not parallel to each other. Such complex layouts make text line extraction challenging due to multi-orientation of paragraphs. This paper introduces a system for the text line extraction from the complex layout documents. Proposed method is based on the concept of dilation and hist...
متن کاملDocument Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)
Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Computer Science
سال: 2010
ISSN: 1877-0509
DOI: 10.1016/j.procs.2010.11.043